Identifying Speakers Using Their Emotion Cues
نویسنده
چکیده
This paper addresses the formulation of a new speaker identification approach which employs knowledge of emotional content of speaker information. Our proposed approach in this work is based on a two-stage recognizer that combines and integrates both emotion recognizer and speaker recognizer into one recognizer. The proposed approach employs both Hidden Markov Models (HMMs) and Suprasegmental Hidden Markov Models (SPHMMs) as classifiers. In the experiments, six emotions are considered including neutral, angry, sad, happy, disgust and fear. Our results show that average speaker identification performance based on the proposed two-stage recognizer is 79.92% with a significant improvement over a one-stage recognizer with an identification performance of 71.58%. The results obtained based on the proposed approach are close to those achieved in subjective evaluation by human listeners.
منابع مشابه
Production of English Lexical Stress by Persian EFL Learners
This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...
متن کاملTracing vocal emotion expression through the speech chain: Do listeners perceive what speakers feel?
This study examines whether vocal cues can be used to reliably infer speaker happiness. Two-hundred speakers were asked to perform a simple referential communication task and to rate their current emotional state. A range of vocal cues was traced through the speech chain using path analysis. The results indicate that reported happiness of the speakers and perceived happiness of the listeners we...
متن کاملPerception by Japanese, Korean and American listeners to a Korean speaker’s recollection of past emotional events: Some acoustic cues
Acoustic and perceptual analyses of spontaneous Korean were made of a Korean woman recalling past emotional events in her life. A subset of 20 single word utterances and 20 isolated vowels were presented to Japanese, American and Korean listeners who were asked to (1) rate the intensity of the perceived emotion and (2) identify the perceived emotion. Listeners could rate intensity and identify ...
متن کاملSound frequency affects speech emotion perception: results from congenital amusia
Congenital amusics, or "tone-deaf" individuals, show difficulty in perceiving and producing small pitch differences. While amusia has marked effects on music perception, its impact on speech perception is less clear. Here we test the hypothesis that individual differences in pitch perception affect judgment of emotion in speech, by applying low-pass filters to spoken statements of emotional spe...
متن کاملEmotion Detection in Task-oriented Spoken Dialogs
Detecting emotions in the context of automated call center services can be helpful for following the evolution of the human-computer dialogs, enabling dynamic modification of the dialog strategies and influencing the final outcome. The emotion detection work reported here is a part of larger study aiming to model user behavior in real interactions. We make use of a corpus of real agent-client s...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- I. J. Speech Technology
دوره 14 شماره
صفحات -
تاریخ انتشار 2011